Area-Efficient Evaluation of Arithmetic Expressions Using Deeply Pipelined Floating-Point Cores
نویسندگان
چکیده
Due to technological advances, it has become possible to implement floating-point cores on FPGAs in an effort to provide hardware acceleration for the myriad applications that require high performance floating-point arithmetic. However, in order to achieve a high clock rate, these floating-point cores must be deeply pipelined. Due to this deep pipelining and the complexity of floating-point arithmetic, floating-point cores use a great deal of the FPGA’s area. It is thus important to use as few floating-point cores in an architecture as possible. However, the deep pipelining makes it difficult to reuse the same floatingpoint core for a series of floating-point computations that are dependent upon one another. In this paper, we describe an area-efficient architecture and algorithm for the evaluation of arithmetic expressions. This design effectively hides the pipeline latency of the floating-point cores and uses only one floating-point core for each type of operator in the expression. The design is applicable to a wide variety of fields such as scientific computing, cognition, and graph theory. We analyze the performance of this design when implemented on a Xilinx Virtex-II Pro FPGA. Focus Session—Reconfigurable Supercomputing
منابع مشابه
Area-Efficient Evaluation of a Class of Arithmetic Expressions Using Deeply Pipelined Floating-Point Cores
Due to technological advances, it has become possible to implement floating-point cores on FPGAs in an effort to provide hardware acceleration for the myriad applications that require high performance floating-point arithmetic. However, in order to achieve a high clock rate, these floating-point cores must be deeply pipelined. Due to this deep pipelining and the complexity of floating-point ari...
متن کاملDesigning Of Pipelined Architecture of Arithmetic Core and Analysis of Area and Timing Performance
The aim of this paper is designing of pipelined architecture of arithmetic core and analysis of area and timing performance of that arithmetic core consisting of fixed point as well as floating point arithmetic cores. The basic concept behind designing such a core is to optimally utilize the algorithms of fixed point as well as floating point arithmetic operations, i.e., addition, subtraction d...
متن کاملFPGA-specific synthesis of loop-nests with pipelined computational cores
The increased capacity and enhanced features of modern FPGAs opens new opportunities for their use as application accelerators. However, for FPGAs to be accepted as mainstream acceleration solutions, long design cycles must be shortened by using high-level synthesis tools in the design process. Current HLS tools targeting FPGAs have several limitations including the inefficient use of deeply pi...
متن کاملComparison of pipelined IEEE-754 standard floating point adder with unpipelined adder
Many Digital Signal Processing (DSP) algorithms use floating-point arithmetic, which requires millions of calculations per second to be performed. For such stringent requirements, design of fast, precise and efficient circuits is the goal of every VLSI designer. This paper presents a comparison of pipelined floating-point adder complaint with IEEE 754 format with an unpipelined adder also compl...
متن کاملVariable Precision Floating-Point Divide and Square Root for Efficient FPGA Implementation of Image and Signal Processing Algorithms
Field Programmable Gate Arrays (FPGAs) are frequently used to accelerate signal and image processing algorithms due to their flexibility, relatively low cost, high performance and fast time to market. For those applications where the data has large dynamic range, floating-point arithmetic is desirable due to the inherent limitations of fixed-point arithmetic. Moreover, optimal reconfigurable ha...
متن کامل